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Ch-iTTi^-rig JDNA-bin^na/DNA Methvltransf erase Nucleic Acid and 
Polypeptide^and^Uses Thereof^, 
5 This application claims the benefit of U.S. Provisional 
Application No. 60/004,445, filed September 28, 1995, and 
this application is a continuation-in-part of U.S. Serial 
No. 08/594,866, filed January 31, 1996, A the contents of 
which are hereby incorporated by reference into the present 
10 application . 



Background of the Invention 

Throughout this application, various publications are 
15 referenced by author and date. Full citations for these 
publications may be found listed alphabetically at the end 
of the specification immediately preceding the claims. The 
disclosures of these publications in their entireties are 
hereby incorporated by reference into this application in 
2 0 order to mbjre fully describe the state of the art as known 
to those skilled therein as of the date of the invention 
described and .claimed herein. 

All -. mammalian, /promoters, tested to date have been found to be 
25 / silenced when they contain 5-methylcytosine (m 5 C) at CpG 
sites , (reviewed by Bestor , . .= 1990 ; Meehan et al., 1993). 
Methylation represses transcription directly. by interference 
with binding of transcription factors (Joel et al . , 1993, 
* and - references, . therein) and methylated sequences are 
30 ^assembled into condensed. chromatin that is inaccessible to 
transcription factors (reviewed by Bird, 1992) . It has been 
-shown that methylation of CpG dinucleotides in the 5' long 
terminal repeat . (LTR) of HIV-1 suppresses viral 
transcription (reviewed by Bednarik, 1992) . Methylation can 
35 reduce -transcription from the HIV-1 5' LTR to undetectable 
r . levels . ^.(Joel et al. , 1993) , It is , also known that, 
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methylation patterns are transmitted by clonal inheritance 
(Wigler, 1981), and that this is due to the strong 
preference of mammalian DNA methyltransf erase for 
hemimethylated DNA (Bestor and Ingram, 1983; Bestor, 1992) . 
5 The heritability of methylation patterns causes the affected 
promoters to be irreversibly inactivated, as has been 
observed for a large number of genes in cultured cells 
(reviewed by Holliday, 1993) and in human tissues (Yoshiura 
et al., 1995; Herman et al . , 1994). While the exact 
10 function of DNA methylation remains the subject of 
discussion, its ability to suppress transcription is not in 
doubt . 

Most exogenous nucleic acid sequences that a resident in a 
15 cell such as transposable elements and proviral DNA are 
methylated and inactivated in the genomes of mammals, 
flowering plants, and those fungi whose genomes contain m 5 C. 
Retroviral DNA is especially prone to inactivation by de 
novo methylation upon germline transmission, and this has 

2 0 greatly reduced the usefulness of retroviral transducing 

vectors in the construction of transgenic animals (Jahner 
and Jaenisch, 1984; Jaenisch et al . , 1985). Treatment of 
cultured cells or" mice with fche^demethy rating drug: 5- 
azacytidine (an irreversible - inhibitor of-^- \DNA 

25 methyltransf erases) can reactivate methylated retrovirus 
genomes (Jaenisch et al 1985) >■ and endogenous gdnes^that 
have been silenced by ectopic de novo methylation of 
promoter regions can also be reactivated by demethylation 
(Holliday, 1993; Yoshiura et al . , 1995).- It is notable that: 

30 organisms whose DNA lacks m 5 C (such as Droeophila) suffer 
far larger numbers of insertion mutations (Ashburner, 1992) , 
which may reflect a reduced capacity for the control of 
mobile elements. These observations, together with 

evolutionary considerations (Bestor, 1990) , strongly suggest- 

3 5 that cytosine methylation is part of a genomic host defense 
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system that limits the proliferation of parasitic sequence 
elements (Bestor, 1990; Bestor and Coxon, 1993) . The 
selective advantage of such a defensive system is obvious, 
given that a significant fraction of the genome represents 
5 exogenous sequences that are invisible to the immune system 
and which might impose a lethal mutagenic or cytotoxic 
burden if allowed to proliferate unchecked. 

The proviral DNA of many retroviruses is propagated in the 
10 repressed, latent state as a result of methylation of LTR 
sequences (reviewed by Jahner and Jaenisch, 1984) . A body 
of evidence indicates methylation can cause latency in cells 
infected with HTLV-1 (Cassens and Ullrich, 1993) or HIV-1 
(Bednarik et al., 1990; Joel et al . , 1993 reviewed by 
15 Bednarik, 1992) . The importance of methylation in latent 
HIV-1 infections in patients is unknown, but as HIV has only 
recently entered human populations, the host-parasite 
relationship may be far from equilibrium (Baltimore, 1995) 
and HIV proviral DNA may be inactivated with low efficiency 
20 under normal conditions (McCune, 1995) . It is clear, 
however, that HIV-1 transcription is very sensitive to 
methylation of 5' LTR sequences (reviewed by Bednarik, 
1992) . 
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Summary of the Invention 

The present invention provides a chimeric protein which 
5 comprises a mutated DNA methyltransf erase portion and a DNA 
binding protein portion that binds sufficiently close to a 
promoter sequence of a target gene, which promoter sequence 
contains a methylation site, to specifically methylate the 
site and inhibit activity of the promoter and thus inhibit 
0 expression of the target gene. This invention also provides 
for a method for inhibiting the expression of a target gene 
which includes contacting a promoter of the target gene with 
the chimeric protein, so as to specifically methylate the 
promoter sequence of the target gene thus inhibiting 
5 expression of the target gene . 
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Brief Description of the Figures 

Figure 1 ■ Distribution of CpG sites and transcription 
factor binding sites in the 5' LTR of HIV-1 (Sequence I.D. 

5 No. 1) . CpG sites that fall within the recognition sequence 
of a restriction methyl transferase or methylation-sensitive 
endonuclease are shown above downward-pointing arrows . Data 
are from Garcia and Gaynor, 1994. 

10 Figure 2 . Sequence of HIV-1 5' LTR. CpG dinucleo tides are 
bold and underlined; CpG sites that lie within the 
recognition sequence of methylation-sensitive restriction 
endonuclease are bold and italicized. Major transcription 
start site is denoted by +1, and the underlined sequence 

15 following denotes the transcribed sequence; numbering is 
with respect to the transcription start site. Please refer 
to Figure 1 for location of protein binding sites. Strike 
through indicates the 9 bp binding site for the 3 finger 
protein developed by Wu et al. (19951. Sequence and 



Figure 3 . Gene silencing via targeted DNA methylation. A 

chimera between a sequence-specific DNA binding protein and 
a DNA methyltransf erase with attenuated DNA binding directs 
25 methylation only to sites in the vicinity of the recognition^ 
sequence of the DNA binding protein: * — * 

Figure 4 . Domain organization 1 of mammalian DNA methyl- 
transf erase. This enzyme is comprised of a long N- terminal 
30 regulatory domain and a C- terminal catalytic domain that* 
closely resembles bacterial restriction methyltransf erase . 
See Bestor and Verdine (1994) for a recent review. 

Figure 5 . Protein-DNA contacts in the M. Haelll-DNA 
35 transition state intermediate. The target cytosine is 



20 



numbering from Garcia and Gaynor, 199 
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everted from the helix during methyl transfer, and is shown 
in the extra helical position. Thin dotted lines indicate 
base-specific contacts in the major groove, bold lines 
indicate contacts with the sugar-phosphate backbone . 
5 Mutation of residues involved in the sugar-phosphate 
contacts will reduce the affinity of DNA methyl -transferases 
for DNA, as will framework mutations that alter the 
disposition of the contact residues. Such mutations will 
make methyl transfer dependent on the DNA binding protein 
10 moiety of a DNA binding protein/DNA methyltransf erase 
chimera, as described herein. Data from Reinisch et al. 
(1995) . 

Figure 6 . pLS, an expression construct to be used in the 
15 selection of DNA binding protein/DNA methyl transferase 
chimeras that methylate predetermined sequences. Lex A was 

chosen for its ability to direct fused proteins to Lex A 
binding sites on DNA (Brent and Ptashne, 1986) , and M.SssI 
was chosen because of its high specific activity and the 

2 0 fact that it is the only bacterial methyltransf erase that 

has the same specificity (5'-CpG-3') (Renbaum and Razin, 
1992) as, the mammalian enzyme (Bestor and Ingram, 1983) . 

Figure 7. Cyclic ±n vitro/in vivo selection for DNA binding 
25 „ protein/DNA methyltransf erase chimeras in which methylation 
is dependent on the DNA binding protein moiety. McrBC en- 
codes a nuclease that degrades heavily methylated DNA. 
Unmethylated DNA that has been linearized by a methylation- 
sensitive endonuclease does not transform. Use of these 

3 0 selective procedures on combinatorial libraries allows 

selection of chimeric DNA binding protein-DNA 
methyltransf erase that target methylation only to a unique 
target site. This selection scheme may be applied to 
LexA/DNA methyltransf erase chimeras (Example 1) and to zinc 
3 5 finger/DNA methyltransf erase chimeras (Examples 3-5) . 
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Figure 8 . Identification of Cp6 sites in the HIV-1 5' LTR 
that yield maximal suppression of transcription when methy- 
lated. Individual 2 0mer primers that contain single m 5 CpG 
5 sites are hybridized to single -stranded M13 clones that 
contain the HIV-1 LTR and the bacterial CAT reporter gene. 
Extension of the primers with a DNA polymerase (sequenase) 
yields double- stranded DNA that is methylated on one strand 
at one CpG site. Upon transfection the hemimethylated site 
10 rapidly becomes methylated (Busslinger et al . , 1983), The 
positions of each of the 11 CpG sites in the HIV-1 5' LTR 
are given with respect to the transcription start site in 
the HIV-1 genome (Garcia and Gaynor, 1993) . 

15 Figure 9 . Phage- display selection of zinc finger 

proteins that bind to predetermined sequences in the HIV-1 
5' LTR. The strategy used to construct tridactyl zinc 
finger proteins that bind to specific sequences was 
independently developed by the laboratories of Berg 

20 (Desjarlais and Berg, 1993), Klug (Choo and Klug, 1994), 
Pabo (Rebar and Pabo, 1994), and Barbas (Wu et al . , 1994). 
The schemes are very similar; the one depicted here .is -that 
of Choo; and Klug (1994) . The method hasJbeen used- to. , obtain 
proteins that bind to a number .of -predetermined - sequences 

25 with high specificity and affinity.- The boxed sequence 
shown in the figure appears 2 0 base pairs '5 ' /of the ; first 
Hpall (CCGG) site in the LTR sequence * of Figure 2; . it is 
shown for illustrative purposes only (Sequence I.D. .No.,; 2) . 
The actual target sequence will be determined as described 

30 in Example 2. 

Figure 10. Organization and transcription of the HBV genome . 

The viral genome of 3 . 2 kb is represented by a horizontal 
line. Positions of methylatable CpG sites are shown by 
35 vertical lines. Arrows indicate major >viral transcripts . ; 
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Notice that transcription of each major mRNA initiates in a 
cluster of CpG sites. CpG methylation has been shown to 
suppress viral transcription in transgenic animals (Miller 
and Robinson, 1993; Pourcel et al . , 1990). 

Figure s 11A - 11B. Targeted methylation of a preselected CpG 
site. 

(A) A diagram of pLS which encodes a lexA-M.SssI fusion 
protexn. A Smal site is located adjacent ^t^a lexA binding 
site in the same plasmid. LexA is predicted to position 
M.SssI over this site, and preferential methylation of 
thoCmal^ site was expected (M.SssI has no intrinsic 
preference for Smal sites) . The positions of other 
restriction sites are shown approximately to scale; other 
features of the plasmid are omitted for the sake of clarity. 

(B) Data confirms targeted methylation. Odd-numbered lanes 
contain plasmid that encodes enzymatically active fusion 
protein. The unmethylated state^ oucee the DNA to be cleaved 
to the linear form. Notice that the Smal site is largely 
resistant to cleavage in lane 8 (compare to lanes 7 and 10) , 
while the other sites remain sensitive. The methylation- 
insensitive endonulease Ncol was used as a control. Heavy 

..methylation is observed only at the Smal site; little or no 
detectable methylation is seen at Sail and Xhol sites .(lanes 
;4 and 6) . Lin: linear unit-length DNA (unmethylated) . : CCC, 

rcovalently closed circular DNA (methylated in lane 88) . 
Lane 11 contains length markers. The diffuse band that 
migrates above the largest length marker is bacterial 
chromosomal DNA. 

Figure 12. Construction of a plasmid that performs targeted 
methylation. pLM9 was constructed by cloning ^rt A Lac 
repressor-M.SssI fusion gene between the Pvull sites of 
pBluescript. Adjacent to the fusion gene is a synthetic 
methylation target sequence , LacO-CpGs , which contains the 
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Lac operator (to which Lac repressor binds) and a number of 
CpG target sites. These are 12 amino acids in the linker 
between Lad and M.SssI. Expression of the fusion is 
controlled by the native promoter of IacI gene. 

5 

Figure 13. Expression of LacI-M.SssI fusion in E. coli XL1- 
Blue [pLM9] . The targeted methyltransf erase is marked with 
an arrow. The antibody used was anti-Lad. An intact 
fusion protein was produced in good yield. Lane marked pBS 
10 contained proteins extracted from cells that contained only 
the expression vector. 

Figure 14. Targeted methylation demonstrated by bisulfite 
P sequencing analysis. The methylation target shown in Figure 

p 15 12 was isolated from cells expressing Lacl-M. SssI fusion 
Lq proteins . The bisulfite sequencing method was used to 

fr identify methylated sites. pLM9-l and -2 are two sister 



(K 



bisulfite A 3©ne^L carrying BS-modif ied methylation target 



W sequence derived from a pLM9 mutant that has attenuated 

|\ 20 activity. Methylation is limited to CpG sites in * the 

P immediate vicinity of the binding site of the sequence 

5 specific .DNA. binding .protein Lad . pBS is. the precursor:, of 

jjj pLM9 and has no lacI/M.SssI gene; no methylated sifeesiare 

S3 present. pM is a pBS derivative that encodes fully active 

25 (non-targeted) M.SssI gene; all sites are . methylated... :^These 
;data confirm that targeted methylation has. been .t-used^ to 



direct methylation to CpG .sites in the . vicinxny; -pt^-cne 
binding site of a sequence-specific DNA binding prot^xn^ 



30 Figure 15. Zinc-finger (Zif) targetedc methyltransf erases . 

The . above Zif fusion constructs have been made and 
expression of the appropriate fusion proteins has been 
confirmed by immunoblot and methylation analysis. The 
coding regions have jalso been transferred intc^ t£e ^ammalian 
3 5 expression vector pcDNA3;/l/His/^. ? * ' 
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Pigur 16. HIV-1 LTR constructs used to analyz targeted 
methylation. Plasmid pLTR-CAT (top) contains the U3 region 
and 80% of the R region of the HIV-1 proviral long terminal 
repeat (LTR) driving expression of the chloramphenicol 

5 acetyltransf erase (CAT) reporter gene. These elements 
contain the HIV-1 core promoter, enhancer, negative 
regulatory element, and trans-activated region (TAR) . These 
elements also contains 10 CpG dinucleotides (shown as 
circles above the LTR) , which are targets for methylation. 

0 Plasmid pLG-luc (bottom) contains the entire proviral LTR 
and gag leader sequence (GLS) driving expression of the 
firefly lucif erase reporter gene. The 150 bp GLS, which 
lies between the LTR and the first viral gene, contains an 
additional 14 CpG dinucleotides and has been shown to be 

5 important for viral expression. The circles are shaded (see 
box) to indicate the conservation of the individual CpG 
sites in different HIV-1 sequences of class B, the class 
affecting the vast majority of infected individuals in the 
United States and Western Europe. Transcription factor 

0 binding sites are labeled below the LTR and hatched when 
they contain a CpG site. The LTR is flanked by a short 
region of human - genomic DNA present in the original HIV 

Clone. - .-..'v.-. ... ..; ,-1 ; ' ^ i[ 

5 Figure 17* Complete suppression of HIV-1 transcription by 
LTR methylation. Methods: pLTR-CAT was treated with M.SssI 
to methylate all CpG sites. Methylated or unmethylated 
pLTR-CAT was cotransf ected with unmethylated. pLG-luc into 
HLtat cells and lysates were prepared at the times 

0 indicated. To control for the efficiency of transfection 
and recovery, volumes of lysate containing equivalent 
luciferase activity were assayed for CAT activity using the 
FASTCAT- assay kit. Reaction products were quantitated using 
NIH Image- "CAT Activity" indicates conversion of 1- 

5 deoxychloramphenicol to 3 - acetyl -l-^deoxychloramphenicol , 
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normalized to arbitrary units. Control cells were 
transfected with pLG-luc only and harvested 48 hours post- 
transf ection . 
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Detailed Description of the Invention 

The present invention provides for a chimeric protein which 
comprises a mutated DNA methyltransf erase portion and a DNA 
5 binding protein portion that binds sufficiently close to a 
promoter sequence of a target gene, which promoter sequence 
contains a methylation site, to specifically methylate the 
site and inhibit activity of the promoter and thus inhibit 
expression of the target gene . 

10 

As used herein, "chimeric protein" is a protein including at 
least two portions of two proteins positioned adjacent to 
each other so that the coding sequences of both portions are 
in frame and can be translated into one polypeptide. The 
15 chimeric protein may include a portion of a DNA binding 
protein and a portion of a mutated or wild type DNA 
methyltransf erase protein. The two portions of the chimeric 
protein may be linked with intermediate sequences of 8,12,16 
or 20 codons expressed as part of the chimeric protein. 

20 

As used herein, "adjacent" includes two sequences that are 
positioned directly next to one another so that the codons 
of each sequence are in frame and contiguous. "Adjacent" 
also includes two sequences that are nearby one another 
25 linked to intermediate sequences and are in frame and can be 
translated into one polypeptide. 

As used herein "linked to" encompasses a means of joining 
two separate portions of a protein, each of which may have 

3 0 a separate function. The means of joining the functional 
portions of the protein may include a covalent or 
noncovalent association between functional polypeptides via 
an organic substance, an inorganic substance, a nucleotide, 
a polynucleotide, a peptide nucleic acid, a peptide, a 

35 triplex peptide, an electrostatic means, a triplex nucleic 
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acid, or a self -assembling biomaterial . 

As used herein, "DNA methyltransf erase" is a protein which 
is capable of methylating a particular DNA sequence, which 
5 particular DNA sequence may be -CpG- . This protein may be 
a mutated DNA methyltransf erase, a wild type DNA 
methyltransf erase, a naturally occurring DNA 

methyltransf erase, a variant of a naturally occurring DNA 
methyltransf erase, a truncated DNA methyltransf erase , or a 

10 segment of a DNA methyltransf erase which is capable of 
methylating DNA. The DNA methyltransf erase may include 
mammalian DNA methyltransf erase, bacterial DNA 
methyltransf erase, M.SssI DNA methyltransf erase and other 
proteins or polypeptides that have the capability of 

15 methylating DNA. 

As used herein, "mutated DNA methyltransf erase protein 
portion" is a segment of a DNA methyltransf erase protein or 
polypeptide containing an amino acid sequence which is 

20 different from the wild type DNA methyltransf erase amino 
acid sequence. A mutated DNA methyltransf erase protein 
portion - may include a segment of a naturally occurring 
./ variant of the -wild' type DNA ^methyltransf erase protein, a 
segment of a DNA methyltransf erase with a sequence altered 

25 from it he wild .type sequence, a truncated protein or portion 
of protein or polypeptide that is capable of methylating 
DNA. This mutated DNA methyltransf erase may be altered at 
one or more of its amino acids and retains part of the wild 
:type methyltransf erase activity, although the activity of 

30 the mutated DNA methyltransf erase protein portion may be 
attenuated activity. 

As used herein, "DNA binding protein portion" is a segment 
of a DNA binding protein or polypeptide capable of 
35 specifically binding to a particular DNA sequence. / The 
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binding is specific to a particular DNA sequence site . The 
DNA binding protein portion may include a truncated segment 
of a DNA binding protein or a fragment of a DNA binding 
protein. 

5 

As used herein, "binds sufficiently close" means the 
contacting of a DNA molecule by a protein at a position on 
the DNA molecule near enough to a predetermined methylation 
site on the DNA molecule to allow proper functioning of the 
10 protein and allow specific methylation of the predetermined 
methylation site. 

As used herein, "a promoter sequence of a target gene" is at 
least a portion of a non- coding DNA sequence which directs 
15 the expression of the target gene. The portion of the non- 
coding DNA sequence may be in the 5' -prime direction or in 
the 3 ' -prime direction from the coding region of the target 
gene. The portion of the non-coding DNA sequence may be 
located in an intron of the target gene. 

20 

The promoter sequence of the target gene may be a 5' long 
terminal repeat .sequence of a human immunodeficiency virus-1 
proviral DNA; The /target . gene may be 1 a retroviral gene, an 
adenoviral gene, a foamy viral gene, a parvo viral gene, a 
25 .foreign gene expressed in a. cell, an overexpressed gene, or 
a. misexpressed 7 gene . / c- 

As used herein, "specifically methylated means to bond a 
methyl group to a methylation site in a DNA sequence, which 
30 methylation site may be -CpG-, wherein the methylation is 
restricted to particular methylation site(s) and the 
methylation is not random. 

In one embodiment of this invention, the chimeric protein 
35 may be a zinc three-finger. DNA binding, polypeptide linked to 
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a CpG- specific DNA methyltransf erase protein portion. The 
chimeric protein may comprise a mutated Lex A DNA binding 
protein or polypeptide linked to a cytosine 
methyltransf erase protein portion or polypeptide. 

5 

The chimeric protein may include at least a portion of a 
mutated M.SssI DNA methyltransf erase protein or at least a 
portion of a mutated mammalian DNA methyltransf erase . 

10 In one embodiment of this invention, an expression vector 
may code for the chimeric protein described hereinabove . 
The expression vector may be a replicable vector and may 
include a pLS vector, a prokaryotic expression vector, a 
yeast expression vector, a baculovirus expression vector, a 

15 mammalian expression vector, or an episomal mammalian 
expression vector. 

As used herein, "expression vector" is a nucleic acid 
molecule which is capable of having a coding sequence of DNA 
20 or RNA ligated into it and then expressing that sequence 
when introduced into an appropriate host cell. An 
expression vector may be a mammalian expression vector or a 
prokaryotic expression vector. x; : j^o 

25 Another embodiment of this invention is a method ^for 
inhibiting the expression of a target gene which includes 
contacting a promoter sequence of the target gene with the 
chimeric protein described hereinabove, so as... to 
specifically methylate the promoter sequence of the target 

30 gene thus inhibiting expression of the target gene. In this 
embodiment, the target gene may be an endogenous target gene 
which is native to a cell or a foreign target gene. The 
foreign gene may be a retroviral target gene or a viral 
target gene . 
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The target gene in this embodiment may be associated with a 
cancer, a central nervous system disorder, a blood disorder, 
a metabolic disorder, a cardiovascular disorder, an 
autoimmune disorder, or an inflammatory disorder. The 
5 cancer may be acute lymphocytic leukemia, acute myelogenous 
leukemia, B-cell lymphoma, lung cancer, breast cancer, 
ovarian cancer, prostate cancer, lymphoma, Hodgkin's 
disease, malignant melanoma, neuroblastoma, renal cell 
carcinoma or squamous cell carcinoma. The central nervous 

10 system disorder may be Alzheimer's disease, Down's syndrome, 
Parkinson's disease, Huntington's disease, schizophrenia, or 
multiple sclerosis. The infectious disease may be 

cytomegalovirus, herpes simplex virus, human 

immunodeficiency virus, AIDS, papillomavirus, influenza, 

15 Candida albicans, mycobacteria, septic shock, or associated 
with a gram negative bacteria. The blood disorder may be 
anemia, hemoglobinopathies, sickle cell anemia, or 
hemophilia. The cardiovascular disorder may be familial 
hypercholesterolemia , atherosclerosis , or renin/ angiotensin 

20 control disorder. 

The metabolic disorder may be ADA, deficient SCID, diabetes, 
cystic fibrosis, Gaucher' s disease, galactosemia, growth 
hormone deficiency, inherited emphysema, Lesch-Nyhan 
25 disease, liver failure, muscular dystrophy, phenylketonuria, 
or Tay-Sachs disease . The autoimmune disorder may be 
arthritis, psoriasis, HIV, or atopic dermatitis. The 
inflammatory disorder may be acute pancreatitis, irritable 
bowel - syndrome, Chrone's disease or an allergic disorder. 

30 

In one embodiment of the subject invention promoter 
sequences of the target genes listed below may be 
therapeutic target genes of the subject invention: dominant 
oncogenes, c-MYB, c-MYC, c-SIS, ERB A, ERB B, ERB B-l, ERB 
35 B-2, HER-2/NEU, c-SRC, C-YES, c-FPS, c-FES, c-FOS, c-JUN, c- 
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ROS, C-ABL, c - FGR , c-MOS, c-RAS, c-RAF, c-MET, c-ETS, BCL-1, 
BCL-2, ETS , c-FMS, c-FES, C-BLK, TCL-1, TCL-2, TCL-3, TCL-5, 
ALL - 1 /HRX/MLL , PML promoter in PML/RAR-Of fusion, RAR-o? 
promoter in RAR-ar/PML fusion, and NF-1. 

Genes that are overexpressed in cancer cells are also target 
genes of the subject invention. Inhibiting the expression 
of these target genes may reduce tumorigenesis and/or 
metastasis and invasion. Cancer related genes include: 
collagenase 92 Kd Type 4, collagenase 72 Kd Type 4, 
osteopontin, calcyclin, fibroblast growth factor, epidermal 
growth factor, matrilysin and stromolysin. 



Viruses that establish chronic infections and which are 
5 involved in cancer or chronic diseases are also target genes 
of the subject invention. Virus that have possible target 
genes include hepatitis C, hepatitis B, varicella, herpes 
simplex types I and II, Epstein-Barr virus, cytomegalovirus, 
JC virus and BK virus . 

0 

In a preferred embodiment of the subject invention, -the 
promoter sequence of viral protein X and the prqmojter 
sequence of pre-S2/5 of the hepatitis B virus may be target 
genes. .. . • , s - .■ - 

In another embodiment of the subject invention, genes ^whose 
expression is involved in neurodegenerative disease may be 
target genes of the subject invention. These t argent ..genes 
may include beta, amyloid precursor .protein and the prion 

0 protein both of which are shown to be involved in 
Alzheimer's disease. In another embodiment of the subject 
invention, genes which are, involved in the etiology of 
acromegaly may be target genes of the subject invention. 
These target genes may include somatostatin, growth- hormone 

5 releasing hormone and. constitutive G-stimulatory protein 
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mutants . 

The target gene may be in a cell. The cell may be a 
bacterial cell, an animal cell or a plant cell. The cell 
5 may be a eukaryotic cell, a prokaryotic cell, a virus 
packaging cell, a somatic cell, a germ cell, a neuronal 
cell, a myocyte, a T lymphocyte, a CD4 + cell, a tumor cell, 
a CD4+ cell, or a stem cell. 

10 The chimeric protein may be a mutated Lex A DNA binding 
protein portion or polypeptide linked to a cytosine DNA 
methyltransf erase protein portion or polypeptide. 

The DNA methyltransf erase protein portion may be M.SssI DNA 
15 methyltransf erase protein or the mammalian DNA 
methyltransf erase protein. The chimeric protein may include 
a tridactyl zinc finger protein or polypeptide capable of 
specifically binding the HIV-1 5 ' LTR nucleic acid sequence 
linked to a cytosine methyltransf erase polypeptide. The 
20 cytosine DNA methyltransf erase polypeptide may be a M. SasI 
DNA methyltransf erase protein or polypeptide or at least a 
portion of the mammalian DNA methyltransf erase protein. 

The contacting may be by means of liposome mediated 
25 .delivery, retroviral delivery, gene bombardment, 
electroporation, electronic pulse delivery, air-gun 
injection of a nucleic acid-coated pellet, a self -assembling 
nanocrystalline composition, a cytofectin analog, 
condensation of the nucleic acid, receptor mediated gene 
3 0 transfer, glycosylated macromolecular carriers, polar 
(glyco) lipids, (glyco) peptides , synthetic polymers, a 
triplex nucleic acid, naked nucleic acid transfer, particle- 
mediated nucleic acid transfer or cationic precipitation. 

35 A further embodiment of this invention is a - method for 
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inhibiting expression of a target gene in a multicellular 
organism which includes contacting a promoter sequence of 
the target gene with the chimeric protein described 
hereinabove or a nucleic acid molecule encoding the chimeric 
5 protein, so as to specifically methylate the promoter 
sequence of the target gene and thus inhibit expression of 
the target gene in the multicellular organism. 

The multicellular organism may be a plant, an animal or a 
0 human. The plant may be an alfalfa plant, a broccoli plant, 
a rapeseed plant, a carrot plant, a chicory plant, a coffee 
: plant, a cucurbita plant, a euromelon plant, a potato plant, 
a raspberry plant, a sunflower plant, a tomato plant, or a 
wheat plant . 

5 

The administration of the chimeric protein to the 
multicellular organism may comprise intralesional , 
intraperitoneal, intramuscular or intravenous injection; 
liposome -mediated delivery; viral infection; gene 

0 bombardment; topical, nasal, oral, anal, ocular or otic 
delivery. The viral infection may be via a non- integrating, 
replication-defective virus. The virus may comprise^ a 
replication-defective HIV-1 provirus, a retroviral vector, 
an adeno- associated virus; a^N2 -retroviral vector, a SIM 

5 retroviral vector, a LNL6 vector, -a LXSN vector or a MMuLV 
retroviral vector. > 

The process by which a plant or animal as rendered resistant 
to viral infection comprises introducing .into the plant or 

0 animal a construct which on translation gives rise to the 
above-mentioned chimeric protein.^ The introduction of the 
nucleic acid molecule is accomplished by genetic 
transformation of a part of the plant by a DNA sequence 
coding for the nucleic acid molecule, followed by the 

5 regeneration of a transgenic plant. ; The. transformation is 
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carried out by the intermediary of Aarobacterium tumef ari png 
or Aarobacterium rhizogenes . (U.S. Patent Nos . 5,107,065 and 
5,188,958) . 

5 The present invention is further directed to a DNA construct 
for a plant, the construct comprising a genetic sequence and 
a promoter capable of directing expression of the genetic 
sequences wherein the genetic sequence on expression 
provides at least a portion of a sequence specific DNA 

10 binding protein linked to at least a portion of a mutated 
DNA methyltransf erase protein. The DNA construct may 
further be part of a DNA transfer vector suitable for 
transferring the DNA construct into a plant cell and 
insertion into a plant genome . In an embodiment of the 

15 present invention, the DNA construct is carried by broad 
host range plasmid pGA470 which is capable of transformation 
into plant cells using Aarobacterium . The present 

invention, however, extends to other means of transfer such 
as genetic bullets (e.g. DNA-coated tungsten particles, 

20 high- velocity micro pro j ectile bombardment ) and 
electroporation amongst others (Maliga, 1993; Bryant, 1992; 
or Shimamoto, 1989) . . j jfv \ 

The gene encoding the chimeric protein, having- been 
25 introduced into the nonhuman mammal, or an ancestor* of -the 
nonhuman mammal at the single cell stage or an embryonic 
stage, is linked to a promoter and integrated into the 
genome of the nonhuman mammal. One skilled in the art would 
be familiar with the experimental methods necessary, to 
3 0 produce a transgenic mammal, as described in Leder et al . , 
U. S. Patent No. 4,736,866 and Krimpenfort and Berns, U. S. 
Patent No. 5,175,384 and Wagner and Chen, U. S. Patent No. 
5,175,385. 

35 A further embodiment of the subject invention is a method. of 
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treating a subject infected with a virus which comprises 
administering to the subject a therapeutic composition 
comprising a chimeric protein, or a nucleic acid molecule 
encoding the chimeric protein, capable of sequence specific 
5 methylation of a site(s) in a promoter sequence of a target 
viral gene under suitable conditions so as to specifically 
methylate the promoter sequence of the target gene thereby 
inhibiting expression of the target gene and thus treating 
the subject infected with the virus. Examples of viruses 
10 are listed herein. 

The therapeutic composition may comprise a nucleic acid 
encoding at least a portion of a mutated Lex A DNA binding 
protein or polypeptide linked to at least a portion of a 

15 mutated DNA methyltransf erase protein or polypeptide. The 
therapeutic composition may be a nucleic acid encoding a 
tridactyl zinc finger DNA binding protein or polypeptide 
capable of specifically binding the HIV-1 5' LTR nucleic 
acid sequence linked to at least a portion of a mutated DNA 

2 0 methyltransf erase protein or polypeptide. The therapeutic 
composition may comprise a nucleic acid encoding at least a 
portion . <of a sequence, .specific DNA binding protein: or 
* epolypep tide linked ;^to :a .mutated DNA methyltransf erase 
^protein or polypeptide . ; j, _ , 

25- ^ ; r ,. . . : j ; ; : . 

;The subjects receiving the therapeutic composition of this 
invention may be a human or a non-human mammal. 

The therapeutic composition, may comprise a replicable 
30 expression vector, chosen from the group consisting of a pLS 

vector, a prokaryotic expression vector, a yeast expression 
.vector, a haculovirus expression vector, a mammalian 

expression vector, and an ep.isomal mammalian expression 

vector. The administration may comprise intralesional , 
35 , : intraperitoneal ; intramuscular or intravenous injection; 
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liposome -mediated delivery; viral infection; gene 
bombardment; topical, nasal, oral, anal, ocular or otic 
delivery. The viral infection may be via a non- integrating, 
replication-defective virus. 

5 

In one preferred embodiment of the method above the nucleic 
acid molecule encoding a chimeric protein is incorporated 
into a liposome to allow for administration to the subject. 
Methods of incorporation of nucleic acid molecules into 

10 liposomes are well known to those of ordinary skill in the 
art. In another embodiment of this method, the nucleic acid 
encoding the chimeric protein may be delivered via 
transf ection, injection, or viral infection. There are 
several protocols for human gene therapy which have been 

15 approved for use by the Recombinant DNA Advisory Committee 
(RAC) which conform to a general protocol of target cell 
infection and administration of transf ected cells (see for 
example, Blaese, R.M. , et al., 1990; Anderson, W. F. , 1992; 
Culver, K.W. et al . , 1991). In addition, U.S. Patent No. 

20 5,399,346 (Anderson, W. F. et al . , issued March 21, 1995) 
describes procedures for retroviral gene transfer. The 
contents of these support references are incorporated in 
their entirety into the subject application. Retroviral- 
mediated gene transfer requires target cells which are 

25 undergoing cell division in order to achieve stable • 
integration hence, cells are collected from a subject often 
by removing blood or bone marrow. • 1 

Several methods have been developed over the last decade- for 
3 0 the transduction of genes into mammalian cells for potential 
use in gene therapy. In addition to direct use of plasmid 
DNA to transfer genes, episomal vectors, retroviruses, 
adenoviruses, parvoviruses, and herpesviruses have been used 
(Anderson et al . , 1995; Mulligan; 1993; The contents of 
35 which are incorporated in their entirety -into the subject 
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application) . For transfer of genes into cells ex vivo and 
subsequent reintroduction into a host, as would be most 
feasible in immunodeficiency patients, retroviruses have 
been the vectors of choice . 

5 

A separately preferred embodiment of the invention is a 
replicable expression vector encoding a chimeric protein 
capable of sequence specific DNA methylation. 

10 The replicable expression vector may be chosen from the 
group consisting of a pLS vector, a prokaryotic expression 
. vector, a yeast expression vector, a baculovirus expression 
vector, a mammalian expression vector, and an episomal 
mammalian expression vector. The agent may comprise a 

15 mutated Lex A binding polypeptide linked to a 
methyltransf erase polypeptide. The agent may comprise a 
tridactyl zinc finger polypeptide that specifically binds to 
the HIV-1 5'LTR nucleic acid sequence linked to a 
methyltransf erase polypeptide. 

20 

Another embodiment of the invention is a pharmaceutical 
composition comprising a therapeutically effective amount of 
the/ replicable vector described hereinabove and . : a 
pharmaceutical^ acceptable carrier. The carrier, may 
25 comprise a diluent. The pharmaceutical^ acceptable carrier 
may be an aerosol, intravenous, oral or topical .carrier and 
is further described herein. 

Another embodiment of the subject invention is a method of 
30 obtaining an expression vector including. DNA encoding a 
desired chimeric protein that includes at least a portion of 
a mutated DNA methyltransf erase protein and at least a 
portion of a DNA binding protein such chimeric protein being 
capable of inhibiting the expression of a target gene which 
35 comprises: (a) obtaining a population of expression vectors 
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each of which expresses a chimeric protein including (l) a 
DNA corresponding to an endogenous promoter sequence of the 
target gene, which promoter contains at least one sequence 
specific methylation site; (2) a DNA encoding a mutated DNA 
5 me thyltransf erase ; and (3) a DNA encoding a DNA binding 
protein, position adjacent the DNA of (2) , DNA (2) and (1) 
being so positioned in the expression vectors as to permit 
expression of the chimeric proteins; (b) introducing the 
population of vectors from step (a) into an appropriate host 

10 under conditions such that the chimeric proteins are 
expressed and methylate DNA; ( c) isolating the population 
of vectors from step (b) from the host; (d) treating the 
population of vectors from step ( c) with a suitable 
restriction endonuclease so as to digest the DNA at a 

15 specific site in the promoter if the site is not methylated; 
(e) introducing the population from step (d) into an 
appropriate host, which hosts have the property that they 
degrade DNA which has been non- specif ically methylated at 
sites other than the sequence specific methylation site 

2 0 contained in the promoter; and (f ) culturing the hosts from 
step (e) under conditions such that vectors which have not 
been degraded express either a mutated or a non-mutated form 
of the chimeric protein, thus obtaining at least, one nucleic 
acid encoding a mutated form of a chimeric DNA 

25 methyltransferase/DNA" binding protein capable of inhibiting 
the expression" of the target gene. 

As used herein, "desired" includes optimal properties of a 
chimeric protein as described hereinabove. The properties 
30 include specifically methylating a DNA . sequence in a 
promoter sequence of a target gene and inhibiting the 
activity of the promoter and thus inhibiting expression of 
the target gene, 

35 In the practice of any of the methods of the invention or 
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preparation of any of the pharmaceutical compositions an 
"effective amount" is an amount which is effective to 
produce the sequence specific methyl transferase and 
inactivation of the target gene. Accordingly, the effective 
5 amount will vary with the subject being treated, as well as 
the condition to be treated. For the purposes of this 
invention/ the methods of administration are to include, but 
are not limited to, administration cutaneously, 
subcutaneously , intravenously, parenterally , orally, 
10 topically, or by aerosol . 

'. As used herein, the term "suitable pharmaceutically 
acceptable carrier" encompasses any of the standard 
pharmaceutically accepted carriers, such as phosphate 

15 buffered saline solution, water, emulsions such as an 
oil/water emulsion or a triglyceride emulsion, various types 
of wetting agents, tablets, coated tablets and capsules. An 
example of an acceptable triglyceride emulsion useful in 
intravenous and intraperitoneal administration of the 

20 compounds is the triglyceride emulsion commercially known as 
Intralipid®. 

Typically such carriers contain excipients such as starch, 
milk, sugar, certain types of clay, gelatin, stearic acid, 
25 talc, vegetable fats or oils, gums, glycols, or other ..known 
excipients . Such carriers may also include flavor and color 
additives or other ingredients. 

Also comprehended by the invention are pharmaceutical 
3 0 compositions comprising therapeutically effective amounts of 
polypeptide products of the invention together with suitable 
diluents, preservatives, solubilizers , emulsifiers, 
adjuvants and/or carriers. A "therapeutically effective 
amount" as used herein refers to that amount which provides 
35 a therapeutic effect for a given condition . and 



WO 97/11972 



PCT/US96/15576 



-26- 

administration regimen. Such compositions are liquids or 
lyophilized or otherwise dried f ormulations and include 
diluents of various buffer content (e.g., Tris-HCl., 
acetate, phosphate) , pH and ionic strength, additives such 
5 as albumin or gelatin to prevent absorption to surfaces, 
detergents (e.g., Tween 20, Tween 80, Pluronic F68, bile 
acid salts). solubilizing agents (e.g., glycerol, 
polyethylene glycerol), anti-oxidants (e.g., ascorbic acid, 
sodium metabisulf ite) , preservatives (e.g., Thimerosal, 

10 benzyl alcohol, parabens) , bulking substances or tonicity 
modifiers (e.g., lactose, mannitol) , covalent attachment of 
. polymers such as polyethylene glycol to the protein, 
complexation with metal ions, or incorporation of the 
material into or onto particulate preparations of polymeric 

15 compounds such as polylactic acid, polglycolic acid, 
hydrogels, etc, or onto liposomes, microemulsions , micelles, 
unilamellar or multilamellar vesicles, erythrocyte ghosts, 
or spheroplasts . Such compositions will influence the 
physical state, solubility, stability, rate of in vivo 

20 release, and rate of in vivo clearance. The choice of 
compositions will depend on the physical and chemical 
properties of the protein. For example, a product derived 
,:from a membrane -bound .-form of, a protein may require a 
formulation containing detergent . Controlled or sustained 

25 ^release compositions include formulation in lipophilic 
cdepots (e.g., fatty acids, waxes, oils). Also comprehended 
by the invention are particulate compositions coated with 
polymers (e.g., poloxamers or poloxamines) and coupled to 
antibodies directed against tissue -specific receptors, 

30 ligands or antigens or coupled to ligands of tissue-specific 
receptors. Other embodiments of the compositions of the 
invention incorporate particulate forms protective coatings, 
protease inhibitors or permeation enhancers for various 
■ routes of administration, including parenteral, pulmonary, 

3 5 * nasal and oral. 
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Another embodiment of the subject invention is a transgenic 
non- human mammal whose somatic and germ cells contain and 
express a gene coding for a desired chimeric protein which 
is capable of inhibiting the expression of a target gene, 
5 the gene having been stably introduced into the non-human 
mammal at the single cell stage or an embryonic stage, and 
wherein the gene is linked to a promoter and integrated into 
the genome of the non-human mammal. The desired chimeric 
protein may be the desired chimeric protein described 
10 hereinabove. One skilled in the art would be familiar with 
the experimental methods necessary to produce a transgenic 
mammal/ as described in Leder et al . , U. S. Patent No. 
4,736,866 and Krimpenfort and Berns, U. S. Patent No. 
5,175,384 and Wagner and Chen, U. S. Patent No. 5,175,385. 

15 

If it were possible to stimulate de novo methylation of HIV 
proviral DNA, then clonal inheritance of methylation 
patterns would render the inactivation essentially 
irreversible. As described herein, this irreversible 

20 inactivation is the basis of a therapeutic approach that has 
practical advantages over alternative molecular biological 
approaches (including antisense RNA, ribozymes, targeted 
nucleases, and intracellular anti -HIV antibodies p reviewed 
by Volberding, 1995). Provirus silencing by targeted 

25 cytosine methylation stimulates an existing -host defense 
system and as an analogy, has the virtues of a vaccine : ; the 
clonal inheritance of methylation patterns is like the 
- immunological memory, . and the strong suppression of 
transcription is like the immune defense system in that both 

30 perturb the function of particular gene products. 

It is important to point out that the therapeutic agent that 
methylates the target sequence need be present only 
transiently; the new methylation pattern would be propagated 
35 indefinitely by the cellular DNA methylating system <(Bestor, 
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1990; Bestor, 1992; Bestor and Coxon, 1993; Bestor and 
Verdine, 1994) . The alternate molecular biological 

approaches to medical therapies mentioned above, require 
indefinite expression of the agent . This has proven to be 

5 difficult in practice (Challita and Kohn, 1994) and may be 
toxic or may provoke a harmful immune response . An 
additional complication of long-term expression is 
insertional mutagenesis, while inactivation by targeted 
methylation as described herein may involve delivery of the 

0 agent by a nonintegrating, replication-defective virus that 
establishes an abortive infection. None of the alternative 
molecular biological therapies discussed above has these 
advantages . 

5 The 5' LTR of HIV-1 may be inactivated by targeted cytosine 
methylation. As shown in Figure 1 below, the LTR that 
directs transcription of the provirus is rich in binding 
sites for transcription stimulatory proteins (Nabel, 1993; 
Garcia and Gaynor, 1994) and rich in CpG sites, the 

0 predominant site of methylation in vertebrates. It is also 
well-established that methylation of only a few of the CpG 
sites in the 5' LTR represses transcription to nearly 
undetectable levels (Bednarik et al., 1990; Joel et al . , 
1993, reviewed by Bednarik, 1992), but the specific, 

5 ^critical sites that mediate this effect have not been 
^identified. - 

Mammalian promoters are silenced by cytosine -5 methylation 
at 5'-CpG-3' dinucleotides . The promoter within the 5' LTR 
0 of HIV-1 may be silenced by directing enzymatic methylation 
to critical CpG sites. Because methylation patterns are 
subject to clonal inheritance, the silencing is essentially 
irreversible. 

5 The subject invention encompasses but is not limited to the 
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following additional embodiments. 

1. Methylation of a predetermined CpG site. In one 
embodiment of the subject invention a sequence -specific DNA 

5 binding protein may be fused to a DNA cytosine 
methyltransf erase, and mutant fusion proteins whose 
methyltransf erase activity is dependent on the DNA binding 
protein may be selected from combinatorial libraries. DNA 
binding protein/DNA methyltransf erase chimeras that 

10 methylate only those CpG sites immediately adjacent to the 
binding site may be obtained by means of a novel cyclic in 
vivo/in vitro selection protocol. An attenuated version of 
the CpG-specific DNA methyltransf erase M.SssI may be 
constructed in this way. In vitro selection requires 

15 resistance to methylat ion- sensitive restriction endonuclease 
cleavage at the target site, and in vivo selection removes 
plasmid molecules methylated at non-specific sites; this 
second step involves growth in AfcrBC^zStrains of E. coli, 
which degrade heavily methylated DNA. 

20 

2. Identification of those CpG sites in the HIV-1 5' LTR 
whose methylation produces maximal repression of transcrip- 
tion. The documented inhibitory effects of methylation on 
HIV-1 transcription are likely to be exerted through^, a 

25 subset of the 11 CpG sites in the 5' LTR of HIV-1. -In a 
further embodiment of the subject invention/ these critical 
CpG sites may be identified by transcription assays with 
test constructs that bear singly 'methylated CpG sites; each 
of the 11 CpG sites can be tested individually. -The 

30 information obtained may be important for the selection of 
target sites in Examples 3-5. 

3. Design, selection, and affinity maturation of zinc 
finger-DNA methyltransf erase chimeras that methylate 

35 critical CpG sites in the HIV 5' LTR. In another embodiment 
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of the subject invention, zinc finger modules may be 
catenated in novel combinations to yield proteins that bind 
with high affinity and specificity to predetermined 
sequences. Zinc finger proteins that bind to sites adjacent 
5 to critical CpG sites (identified in Example 2) may be 
selected from combinatorial expression libraries and fused 
to an attenuated CpG-specific DNA methyltransf erase 
(developed in Example 1) . Further rounds of cyclic in 
vivo/in vitro selection (as developed in Example 1) may be 
0 used to obtain variants that methylate only CpG sites 
adjacent to the binding site of the zinc finger moiety. 

4. Inhibition of HIV-1 5' LTR -dependent transcription in 
cultured human cells that express zinc finger-DNA methyl- 
transf erase fusion proteins of novel and predetermined 
specificity. Example 3 yields constructs that methylate 
critical sites in the HIV-1 5' LTR. In a further embodiment 
of the subject invention, these constructs may be introduced 
into cultured human cells that express a reporter gene 
driven by the HIV-1 5' LTR as a means of testing the 
inhibitory effects of targeted de novo methylation. 

5. XnhiJbition of HIV-1 replication in human T lymphocytes 
productively infected with HIV-1. In another embodiment of 
the subject invention, constructs that prove effective in 
.the studies of Example 4 may be introduced into HIV-1 
producing human T lymphocytes, and methylation of the target 
sequence and its effect on virus production can be measured 
directly. The methylating construct may be delivered on a 
plasmid vector or as part of a recombinant adeno- associated 
virus (AAV) . These embodiments suggest a high level of 
usefulness of targeted methylation in the control of HIV-1 
proliferation, and the development of the methylating 
construct into a therapeutic agent. 
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Direction of a DNA methyltransf erase to any CpG site within 
HIV-1 regulatory sequences, may result in the silencing of 
the provirus dependent upon the specific zinc finger 
protein/DNA methyltransf erase chimera. Zinc finger motifs 
5 have been shown to be remarkably versatile DNA binding 
structures (Choo and Klug, 1994) ; they are independently- 
folded protein modules that recognize the edges of 3 base 
pairs in the major groove . Over 1,300 zinc fingers have 
been identified, and the binding specificity of a large 

10 number has been determined (Choo et al . , 1995) . Several 
laboratories have succeeded in assembling zinc finger 
modules in novel combinations; these designed proteins have 
been subjected to refinement by phage-display selection> and 
variants that bind to novel predetermined sequences with 

15 high specificity and nanomolar affinity have been obtained 
by several laboratories (Choo et al., 1994; Wu et al, 1995; 
Desjarlais and Berg, 1994; Rebar and Pabo, 1994) . These 
small proteins contain tandem arrays of three zinc finger 
modules and bind to 9 bp recognition sequences. Klug and 

20 colleagues were able to show that a zinc finger that was 
targeted to the BCR-ABL breakpoint sequence could bind to 
and suppress transcription of the BCR-ABL gene in cultured 
- .human aleukemic cells (Choo -et . al . >: s 19 94 ) . A zinc : f inger 
, protein designed, to bind -to a sequence adjacent to a CpG 

25 site -in the HIV.-l 5' I/TR can be -fused .to * an attenuated ? CpG- 
specific DNA methyltransf erase, thereby achieving efficient, 
selective - methylation of r that CpG site- A DNA 

methyltransf erase with the desired binding properties has 
been cloned from a Spiroplasma species; this enzyme, known 

30 - as M.SssI, has the same sequence specificity as the 
mammalian DNA methyltransf erase (CpG) (as cloned by Bestor; 
Genbank Accession No. X14805) , but a higher turnover number 
and no requirement for hemimethylated substrates (Renbaum 
and Razin, 1992). . The strategy is shown in Figure 3. 
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One embodiment of this invention is the development and 
therapeutic application of zinc finger protein/DNA 
methyltransf erase chimeras or agents whose general 
organization is similar to that of mammalian DNA 
5 methyltransf erase . These agents selectively methylate and 
inactivate the 5' LTR of HIV-1. Such agents have 
considerable promise as therapeutic agents for retroviral 
diseases, and for other applications where selective 
inactivation of a promoter sequence is desired. 

10 

The zinc finger protein/DNA methyltransf erase chimera of 
this invention is similar in design to mammalian DNA 
methyltransf erase, which has been purified (Bestor and 
Ingram, 1983; Bestor and Ingram, 1985a), characterized in 

15 terms of biochemical activities (Bestor and Ingram, 1985; 
Bestor, 1987) , the cDNA that encodes the enzyme cloned 
(Bestor, 1988; Genbank Accession No. X14805; Bestor et al . , 
1988) , and the protein demonstrated to be essential for 
mouse development in a gene disruption study (Li et al., 

20 1992; Li et al . , 1993). As shown below, mammalian DNA 
methyltransf erase has an N- terminal regulatory domain and a 
C-terminal catalytic domain that closely resembles bacterial 
restriction methyltransf erases . For a recent review, of ^the 
structure and function of mammalian DNA methyltransf erase, 

25 see Bestor and Verdine (1994) . ; The agents which are : ; . c 

embodiments of this invention differ from mammalian DNA 
methyltransf erase in that the regulatory domain, targets the 
catalytic domain to predetermined sequences: The regulatory 

3 0 region of mammalian DNA methyltransf erase suppresses the 
methylation of previously linmethylated sites (Bestor, 1992) , 
and directs the protein to sites of new DNA synthesis in S 
(synthesis) phase nuclei (Leonhardt et al . , 1992). The 
targeted methylation which is an embodiment of this 

3 5 invention is grounded in the knowledge of the basic biology 
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of the system that establishes and maintains methylation 
patterns in the mammalian genome. 

The expression construct pCAL7 directs the expression of the 
5 DNA methyltransf erase M.SssI (Renbaum and Razin, 1992). 
This enzyme in present in Spiroplasma species and has the 
same sequence specificity as mammalian DNA methyltransf erase 
(CpG) , but a higher turnover number and no requirement for 
hemimethylated substrates (Renbaum and Razin, 1992) . The C- 

10 terminal domain of mammalian DNA methyltransf erase also has 
CpG-specific DNA methyltransf erase activity and may also be 
used in practicing this invention/ although k cat is much 
lower than that of M.SssI (Bestor, 1992; Renbaum and Razin, 
1992) . The plasmid construct described in Example 1 may be 

15 confirmed by restriction analysis and is described in full 
in the following section and depicted in Figure 5. This 
construct embodies (in a simple form) most of the principles 
of the targeted methylation approach described herein. 
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Table 1 shown below, offers data which validates the 
selection against indiscriminate methylation via McrBC 
restriction of methylated DNA (Table 1) . 



E. coli strain Input plasmid CFU /xg DNA* 1 

(Mcr genotype) (methylation status) 



ER1381 (McrBC*) 


pSE4 (— ) 


1.1 X 10 9 


ER1793 (McrBC") 


pSE4 (— ) 


1.7 x 10 9 


ER1381 (McrBC*) 


pCAL7 ( + ) 


6 x 10 s 


ER1793 (McrBC - ) 


pCAL7 (+) 


8 x 10 8 



Table 1» Selection against nonspecific methylation in 
McjcBC+ K. coli. pSE4 is an unmethylated control plasmid; 
pCAL7 directs synthesis of M.SssI (Renbaum et al . , 1992) 
under the control of the p tac promoter. Plasmids were 
introduced into the indicated bacterial strains (Raleigh, 
1992; Sutherland et al . , 1992) by electroporation, and 
transformation efficiencies determined by colony counts on 
selective media. The values shown are conservative, as IPTG 
induction was not used in this case, and the plasmid was 
only partially methylated as a result of basal expression 
from the p tac promoter; measurements of Hpall resistance 
indicated <50% methylation, IPTG treatment increased the 
methylation level and increased restriction from a factor of 
~10 3 to 10 A . 

Promoter inactivation via targeted cytosine methylation* 

A lambda repressor- integrase fusion protein has been shown 
to direct HIV-1 integration to sites in the vicinity of 
lambda operators (Bushman, 1994) . Through an analogous 
mechanism, a chimeric DNA binding protein/DNA 
methyltransf erase will cause preferential methylation of 
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sites in the vicinity of the recognition sequence of the DNA 
binding protein through a very large increase in the local 
concentration of DNA methyltransf erase . As shown in Figure 
3 , this may be used to selectively methylate and silence 
5 predetermined promoter sequences . 

Crystallography data have shown that DNA cytosine 
methyltransf erases make base-specific contacts in the major 
groove, and numerous sequence -independent contacts with the 
0 DNA backbone (Bestor and Verdine, 1994) . A diagram of these 
contacts is shown in Figure 5. 

Mutations of residues involved in the backbone contacts 
lowers the affinity of the enzyme for DNA and reduces the 

5 rate of methyl transfer. However, tethering the weakened 
DNA methyltransf erase to a sequence -specific DNA binding 
protein greatly increases the concentration of the DNA 
methyltransf erase in the vicinity of the binding site, thus 
offsetting the effects of reduced affinity. This results in 

0 a chimeric protein that methylates sites only in the immedi- 
ate vicinity of the recognition sequence of the DNA binding 
protein. Toxicity due to methylation of collateral CpG 
- sites is obviated. Rather than estimating the - nature ■ of 
mutations that would yield the desired result, proteins with 

5 the desired properties may be selected from large 
combinatorial libraries. The design and selection of such 
chimeric proteins, and their development into potential 
therapeutic agents, is an embodiment of the subject 
invention. The 5' LTR of HIV-1 may be one target -of 

0 inactivation, and a rational, step-wise approach to the this 
is described below in the Experimental Details. 

Gene Inactivation by Targeted Methylation 

5 1. Additional embodiments of methylating agents. 
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As mentioned above, the zinc finger/DNA methyltransf erase 
chimeras discussed and described herein are analogous in 
structure to mammalian DNA methyltransf erase, the enzyme 
that normally establishes and maintains methylation patterns 
5 in the genome. This enzyme has a zinc binding site that has 
been shown to participate in the discrimination of 
unmethylated and hemimethylated CpG sites (Bestor, 1992). 
Replacement of this zinc-binding region with a tridactyl 
finger region of known binding specificity confers a new de 

10 novo specificity on the enzyme. Such an enzyme may be less 
immunogenic, and the efficiency of methylation may be 
increased because the coupling of DNA replication and 
methylation would be restored (Leonhardt et al., 1992) . New 
generations of increasingly effective methylating agents may 

15 be developed using this approach. 

2 . Delivery of the methylating agent 

Described herein is a new therapeutic agent. Many delivery 
agents are described and encompass embodiments of this 

20 invention which are technically sufficient to allow delivery 
to the target cell population. The AAV transducing vector 
used in the experiments of Example 5 has many advantages, 
although mother transducing vectors with superior properties 
are also encompassed in this invention. Delivery of the 

25 methylating agent to a target cell population has no addi- 
tional difficulties over those associated with other molecu- 
lar biological therapeutics . 

As pointed out earlier, in favorable cases the methylating 
3 0 agent need be present only transiently,, since the new 
methylation pattern is subject to clonal inheritance, and 
gene silencing mediated by this agent is essentially irre- 
versible. This advantage is not enjoyed by alternative 
molecular biological therapeutics. In cases of recurrent 
35 infection or relapse due to silencing of an inadequate 
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fraction of the proviral load, further treatments with the 
original agent or an agent that recognizes a different 
critical CpG site may be undertaken. The highly versatile 
nature of the methylating agents makes such an approach more 
5 favorable than the extant molecular biological therapeutic 
agents . 

3* Prophylaxis in humans and animals 

Targeted methylation may also be applied to prophylaxis and 

10 the prevention of recurrent infections. One promising route 
involves a zinc finger/DNA methyltransf erase chimera that 
resides in the cytoplasm by virtue of the lack of nuclear 
localization signals, and which methylates incoming viral 
DNA during its transit through the cytoplasm. Previous work 

15 has demonstrated that retroviral DNA is accessible to 
nucleases (and therefore to methyltransf erases of similar 
size) while in the cytoplasm (Bowerman et al., 1989), and 
methylation prior to integration would yield a provirus that 
is transcriptionally silenced from the time of integration. 

20 Long-term, stable expression of such a construct could 
protect recurring infections with HIV-1 or other susceptible 
viruses. Successful development of constructs that can 
selectively silence specific viral promoters justifies the 
development of prophylactic derivatives for medical and 

25 veterinary applications and are also included as an 
embodiment of the instant invention. 



4 . Applications of targeted methylation to other gene 
inactivation studies 

3 0 The methylating agents described here are. very versatile, 
and can be targeted to other promoters by installation of 
zinc finger moieties of the appropriate sequence 
specificity. The choice of HIV-1 as a target in this 
proposal was motivated by the severe threat to public health 

35 posed by this virus rather than by any special 
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vulnerability . Slight modification of the design and 
selection procedures can produce agents that me thy late any 
predetermined sequence, and such agents may be useful in any 
case in medicine or experimental biology where it is 
5 desirable to silence a given promoter. 

This invention is illustrated in the Experimental Detail 
section which follows. These sections are set forth to aid 
in an understanding of the invention but are not intended 
10 to, and should not be construed to, limit in any way the 
invention as set forth in the claims which follow 
thereafter. 
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Experimental Details 

In the construction of a nucleic acid vector (see Figure 6) , 
LexA was chosen for its proven ability to direct fused 
5 proteins to LexA binding sites (Brent and Ptashne, 1986) , 
and M.SssI was chosen because of its high specific activity 
and the fact that it is the only bacterial methyltransf erase 
that has the same specificity (5'-CpG-3'; Renbaum and 
Razin, 1992) as the mammalian enzyme (Bestor and Ingram, 
10 1983) . 

Example 1. Targeted de novo methylation. 

The product of the construct represented in Figure 6 yields 
an increased local concentration of DNA methyltransf erase in 
15 the vicinity of LexA binding sites. As described above, 
this alone does not ensure targeted methylation, as the 
M.Sssl moiety retains intrinsic activity towards all CpG 
sites and substantial methylation of collateral sites is to 
be expected; such indiscriminate methylation is lethal to 

2 0 mammalian cells. It is therefore necessary to make the DNA 

methyltransf erase moiety dependent on LexA-mediated DNA 
binding; this may be accomplished by selection of mutant 
versions of M.SssI that have reduced -intrinsic DNA binding 
activity- A novel . cyclic in vivo/in vitro selection 
25 protocol is used to select mutant proteins of the desired 
character. -Resistance to cleavage by a methylation - 
sensitive restriction endonuclease provides in vitro 
selection for methylation of the target site; growth of 
cleavage-resistant pools of plasmid in McrBC* strains of E. 

3 0 coli selects against chimeras that methylate non- target 

sites. (Mcr was named from modified cytosine restriction; 
the system causes the degradation of plasmid DNA that is 
methylated at many positions but has no effect on DNA that 
is methylated at single sites ; Raleigh, - 1992) . The selection 
35 scheme is 'depicted . in Figure, 7 . : ~ 
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As mentioned above, the initial construct methylates both 
specific (that is, the Smal target site) and non-specific 
CpG sites elsewhere on the plasmid. It is therefore 
necessary to transfer DNA binding authority from the 
5 catalytic moiety to the DNA binding moiety; this is done by 
selecting for mutations that prevent the methylation of non- 
specific sites while allowing methylation of the specific 
site . 

10 It cannot be predicted as to which mutations might give the 
desired reduction in affinity for DNA, so random mutations 
are introduced and selection is applied to obtain mutants of 
the desired character. The M.SssI moiety is mutagenized by 
-f^— Mn ++ PCR (PCR amplification in the presence of ^l^&^oauses 

15 mis incorporation of nucleotides and therefore transition 
and transversion mutations) from primers just external to 
the BamHI and AscI sites (the primers are depicted as arrows 
in Figure 8) . The PCR product is cleaved with BamHI and 
AscI, the fragment purified by gel electrophoresis and 

20 ligated between the AscI and BamHI sites of pLS (Figure 6) . 
The ligation products are electroporated into JM105 
(McrBC) . Calculations indicate that optimal ligation, and 
electroporation conditions allow selection to be applied/ to 
>10 9 separate clones per screen (J. Hill and T. Bestor, data 

25 not shown) . After 20 min growth in rich liquid medium at 
37°C and 150 min in rich medium supplemented with 50 /xg ml" 1 
ampicillin, the cells are washed 3 times and plasmid DNA 
extracted by the alkaline lysis procedure. The DNA is 
treated with 5 units fig' 1 Smal for 2 hr at 25°C, then 

30 transformed into McrBC+ E. coli strain ER1381. As 
previously described, the McrBC system degrades DNA that is 
methylated at many sites but does not affect DNA methylated 
at single sites (Raleigh, 1992). It may be necessary . to 
perform more than one iteration of the mutagenesis/selection 

35 procedure, but the process is not demanding and each it era- 
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tion of the complete cyclic in vivo/ in vitro selection 
method can be completed in a few days . 

An additional selection step may be added to the cyclic 
5 selection procedure if the previous two steps give insuffi- 
cient enrichment for the clones of interest . Plasmid pools 
are extracted from McrBCz bacteria and cleaved with Smal 
together with one of the 6 methylation- sensitive enzymes 
that have unique sites in pLS . If both sites are 

10 methylated, no cleavage occurs. If both sites are 
unmethylated, both will be cleaved- Only if one site is 
methylated and the other unmethylated will unit -length 
molecules with compatible ends be produced. These unit- 
length linear molecules are isolated by agarose gel 

15 electrophoresis, circularized by treatment with DNA ligase, 
and reintroduced into E. coli by electroporation. Note that 
this selection step, as in the case of the other two, can be 
completed in less than two days. 

20 The procedure described above is designed to produce a 
chimeric protein that methylates a predetermined target site 
adjacent to the binding site of a sequence-specific DNA 
binding- eprotein. - .' ^Candidate clones may be confirmed, by 
■ simple assays. 1 To- confirm that Smal resistance is due to 

25 - ^methylation of the j target site and not to loss of the .site 
by mutation, candidate plasmids are tested for resistance to 
Smal and sensitivity to the methylation- insensitive 
' : isoschizomer. ;XmaI (both, enzymes, recognize the sequence 
CCCGGG)\ ^Constructs that pass this test are examined for 

30 non-specific methylation by testing for sensitivity to the 
. methylatibn-sensitive endonucleases BstUl (CGCG) , Hpall 
(CCGG) , Maell (ACGT) Hhal (GCGC) . Resistance and sensitiv- 
ity is assessed from banding patterns after agarose gel 
electrophoresis. It ^can be concluded that targeted 

35. methylation was ^achieved upon recovery of plasmids that are 
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resistant to Smal and sensitive to Xmal, BstUl, Hhal, Hpall, 
and Maell . 

If clones of the desired binding and enzymatic specificity 
5 do not emerge from the screen, a likely reason may be 
stearic incompatibility of the DNA binding protein and DNA 
methyltransf erase in the orientation shown in Figure 6 . New 
constructs may be made in which the order of the moieties 
are reversed, and the selection screen described above may 
10 be done on these new constructs. 

The number of iterations of the mutagenesis/selection 
protocol that is required to produce constructs with the 
required characteristics may be variable. However, the 
15 procedure is rapid, and multiple iterations are not 
demanding . 

Constructs that have the desired binding and enzymatic 
specificity are sequenced in order to identify mutations 

20 that make DNA methyltransf erase activity dependent on the 
binding energy of the DNA binding fusion partner. The 
nature and position of the mutations are referred to the 3D 
structure of cytosine r methyltransf erases oM.Hhal 
(Klimasauskas et al . , 1994) and M.Haelll .(Reinisch et al., 

25 - 1995) ; both of these enzymes are closely related to .each ; 
other and to M.SssI, and conservation of tertiary structure 
is very likely. The desired mutations may map to one of the 
many residues that make nonspecific contacts with the DNA 
backbone (reviewed by Bestor arid Verdine, . 1994) . These 

30 mutations are introduced into the starting constructs of 
Example 3 so as to reduce the amount of selection to obtain 
constructs of the desired character. - 



35 



Example 2\ Identification of those CpG sites in the HIV-1 5' 
LTR whose methylation produces maximal v repression of 
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transcription. As described previously, viral transcription 
is strongly repressed by methylation of the HIV-l 5' LTR 
(reviewed by Bednarik, 1992) - It is very likely that this 
effect is exerted through a subset of the 11 CpG sites 
5 present in the 5' LTR of HIV-l (see Figure 1) . The critical 
CpG sites are identified by measurements of transcription 
rates of test constructs that bear single methylated CpG 
sites* The CpG site that gives the highest degree of 
inhibition when methylated serves as the target for Examples 
10 3 and 4 . 



The approach is related to methods established by 
Busslinger et al . (1983) . The Pstl-BamHI insert of pUC- 
BENN-CAT (which contains the HIV-l 5' LTR driving 

15 transcription of the bacterial chloramphenicol 
acetyltransf erase (CAT) gene) (Gendelman et al . , 1986) is 
excised and cloned into M13 mpl8. Single -stranded phage DNA 
is prepared, and synthetic oligonucleotide primers that bear 
m 5 C at single specific CpG sites are hybridized to an M13 

20 clone of the sequence of interest- The primer is extended 
with a DNA polymerase; sequenase is the polymerase of choice 
for this invention because of its high processivity and low 
yield of partially extended products. The nick at the 5' 
,. end of the pr.imer is sealed by incubation with E. call DNA 

25 ligase and the circular double stranded molecules are 
purified away from single stranded template and 
oligonucleotide primer by agarose gel electrophoresis. The 
circular DNA is purified .from gel slices, cleared of 
ethidium bromide by extraction with phenol, collected by 

30 ethanol precipitation, and mixed with equimolar amounts of 
an act in promoter- lucif erase expression construct prior to 
.transfection into HL2/3 cells (Ciminale et al . , 1990). This 
cell line expresses high levels of the HIV-l proteins Gag, 
Env, Tat, Rev, and Nef proteins and affords a good 

35 .simulation of HIV-l transcription in infected T cells; it is 
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especially important to identify CpG sites that suppress 
transcription when methylated and in the presence of the 
virus-encoded stimulatory factor Tat. The hemimethylated 
sites introduced into the constructs are converted to 
5 symmetrically methylated sites immediately upon transfection 
(Busslinger et al., 1983). The ratio of CAT activity to lu- 
ciferase activity (both are measured by simple and well-es- 
tablished assays) are taken as the measure of inhibition by 
CpG methylation. Control constructs are either completely 

10 unmethylated or completely methylated (by extension of the 
primer in the presence of dm 5 CTP) . The location of the 11 
CpG sites and the method of synthesis of the methylated 
expression constructs is shown in Figure 5. The plasmid 
pUC-BENN-CAT and the cell line HL2/3 are distributed by the 

15 NIH AIDS Research and Reference Reagent Program, and have 
been obtained from that source . This experiment is 
straightforward, and the results are invaluable for the 
experiments described in Examples 3, 4, and 5. The assays 
are done both in transiently transfected cells, and in cell 

2 0 clones that have stably integrated the construct; these 
clones are established by G418 selection after unlinked co- 
transformation with pSV2Neo and the pUC-BENN-CAT constructs. 

Example 3. Design, selection, and affinity maturation of 
25 zinc finger -DNA methyltransf erase chimeras that me thy late 
critical CpG sites in the HIV 5' LTR. There are three 
important factors to be taken into account when selecting a 
site for targeted methylation in the HIV-1 5' LTR. First, 
the site must have the ability to repress transcription when 
30 methylated; this is established in the, experiments of 
Example 2. Second, the binding site for the zinc finger 
moiety must be adjacent to (rather than within) the 
recognition sequence of a regulatory factor. The factors 
that bind to the 5' LTR are all host-encoded and are 
35 involved in the —transcription of many cellular genes; 
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methylation of such sites is expected to cause unacceptable 
toxicity through inactivation of vital cellular genes. 
Third, the recognition sequence is preferably from 6 to 25 
base pairs in length and more preferably at least 9 base 
5 pairs in length, because such sequences occur by chance only 
every 4 9 (or 262,144) base pairs. This rarity ensures that 
target sites do not occur by chance in significant numbers 
of cellular promoters. 

10 Zinc finger proteins that meet the above criteria are 
selected from combinatorial expression libraries by phage 
display, and fused to the CpG- specific DNA methyltransf erase 
M. Sssl as described in Example 1, and depicted in Figure 9. 
Rounds of cyclic in vivo/in vitro selection (also described 

15 in Example 1) are used to obtain variants that methylate 
only CpG sites adjacent to the binding site of the zinc 
finger moiety. As before, more than one iteration of the 
mutagenesis/selection procedure may be required, but the 
process has no difficult or time-consuming steps. Only a 

2 0 few days are required for each iteration, and large numbers 

of clones (>10 9 per cycle) can be subjected to selection at 
once . 

The method for the selection of tridactyl zinc finger 
25 proteins that bind to predetermined sequences is shown in 
Figure 9 . 

The method depicted in Figure 9 is used to select a 
tridactyl finger protein that binds to a 9 bp sequence a:dja- 

3 0 cent to a critical CpG site (identified in Example 2) . The 

spacing between the recognition sequence and the target CpG 
site may be adjusted to conform to the sum of the estimated 
radii of the two globular proteins of masses 30,000 (M.SssI) 
and 20,000 (the tridactyl zinc finger protein). ^This 
35 spacing is equivalent to roughly 25 base pairs. 
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The selected finger protein is fused to M.SssI 
methyltransf erase via a random linker to create a second 
combinatorial expression library. From this point forward 
the protocol is very similar to that described in Example 1. 
5 The construct differs from that shown in Figure 6 only in 
that LexA is replaced with the coding region for the zinc 
finger protein, and the LexA binding site is replaced with 
the HIV-1 LTR sequence used for selection. The c-MYC 
epitope tag is included to allow visualization of the 

10 protein on immunoblots and by immunofluorescence. The 
sequence and spacing of the central CpG in the Smal site and 
the recognition sequence for the zinc finger protein is the 
same as that in the HIV-1 LTR target sequence; a minimal 
change that converts the sequences flanking the CpG site to 

15 a Smal site is introduced. This change should not pose any 
problems, as the intrinsic sequence specificity of M.SssI is 
limited to the CpG dinucleotide (Renbaum and Razin, 1993) . 
As was previously done with the LexA construct, cyclic in 
vivo/in vitro selection is used to obtain mutant chimeras in 

2 0 which the DNA methyltransf erase moiety depends on the finger 

protein part of the energy of DNA binding. This obviates 
methylation of collateral CpG sites, which is known to be 
toxic to mammalian cells. As with the LexA construct of 
Example 1, it may be confirmed that Smal resistance is due 
25 to methylation of the target site, and not to loss of the 
site by mutation, by testing for resistance to Smal and 
sensitivity to the methylation- insensitive isoschizomer 
Xmal . Constructs that pass this test are examined for non- 
specific methylation by testing for sensitivity to the 

3 0 methylation-sensitive endonuclease BstUl (CGCG) , Hpall 

(CCGG) , Maell (ACGT) , Hhal (GCGC) . Almost 100 CpG sites 
within the plasmid construct may be tested through use of 
these enzymes. Resistance and sensitivity is assessed from 
banding patterns after endonuclease treatment and agarose 
35 gel electrophoresis. ..As in Example 1, it may be concluded 
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that targeted methylation has been achieved upon recovery of 
plasrnids that are resistant to cleavage by Smal and 
sensitive to Xmal, BstUl, Hhal, Hpall, and Maell. 

5 Alternatively, as with the LexA construct, constructs in 
which the order of the zinc finger and M.SssI moieties has 
been reversed are constructed and subjected to the selection 
protocol described above. As in the case of the pLS 

constructs described in Example 1, the data indicate that it 
10 is possible to screen between 10 9 and 10 10 independent clones 
at once, and each iteration of the selection protocol 
requires only a few days . 

O Example 4. Inhibition of HIV-1 5' LTR -dependent 

g 15 transcription in cultured human cells that express zinc 
|rj finger/DNA methyl transferase fusion proteins of novel and 

H predetermined specificity. Example 3 yields constructs that 

£ methylate critical sites in the HIV-1 5' LTR. These 

W constructs are introduced into cultured human cells that 

^ 20 express a reporter gene from the HIV-1 5' LTR as a means of 
Q testing the inhibitory effects of targeted de novo me thy - 

y lation. j 

tt The plasmid pUC-BENN-CAT (Gendelman et al . , 1986 ) is 

25 introduced into HL2/3 cells (Ciminale, 1990) together with 
unlinked pSV2Neo, and stable transf ectants selected with 
G418. The HL2/3 cell line expresses most of the HIV-1 
accessory proteins, (including Tat) from a replication- 
defective HIV-1 provirus (Ciminale et al . , 1990). The 
3 0 presence of HIV-1 viral accessory proteins renders this 
assay a good simulation of HIV-1 transcription in infected 
cells. Transf ectant clones that express the CAT gene from 
the HIV-1 LTR are identified by RNA blot hybridization. 
Such cell clones are referred to as HL2/3 -LTR-CAT . 
35 :; 
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HL2 / 3 - LTR - CAT cells are transfected by the calcium phosphate 
technique with a mammalian expression vector that directs 
production of the zinc f inger-M. SasI fusion protein (Example 
3) from the cytomegalovirus (CMV) immediate early enhancer- 

5 promoter. The starting vector is pEVRFO (Matthias et al . , 
1990) , which contains the CMV promoter, a beta globin intron 
in the 3' untranslated region for enhanced transcript 
stability, and a polylinker region for convenient cloning. 
We have used this vector with good success in many 

0 constructs (Czank et al . , 1991; Leonhardt et al. ; 1992). 
This expression construct is termed pCMV-ZMet. A pSV2Hyg 
resistance marker is co-transf ected with pCMV-ZMet, and the 
cells subjected to double selection with G418 and 
Hygromycin. Production of zinc f inger/M. SssI fusion protein 

5 are assayed by immunoblot analysis; as with the LexA-M. SssI 
fusion protein of Figure 3, an N-terminal c-MYC epitope tag 
is present, and the protein is visualized with the 9E10 
antibody. 

0 Immunofluorescence microscopy is used to determine whether 
the methylating agent can gain access to nuclei. If it is 
found to be confined to the cytoplasm, the nuclear 
localization signal of SV40 large T antigen may be added to 
the N- terminus, as we have done for a class of fusion pro- 

5 teins to induce their translocation into nuclei (Leonhardt 
et al. , 1992) . 

Cells that express zinc f inger/M. Sssl fusion protein are 
examined for loss of CAT expression by CAT enzyme assay and 

0 by RNA blot hybridization. In the event that the most 
critical CpG site happens to lie within one of the four 
sites for methylation- sensitive restriction endonuclease 
(four of the eleven sites can be assayed in this way; see 
Figure 1), methylation may be assayed by cleavage of DNA 

5 with that enzyme followed by DNA blot hybridization. This 
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is a simple and long- established technique for determination 
of the methylation status of individual CpG sites. If the 
critical site chosen for targeted methylation does not lie 
within such a restriction site, the genomic sequencing 
5 method of Grigg and Clarky (1994) may be used to identify 
methylated cytosines. The method of Collins and Myers 
(1987) can also be used to detect methylation differences. 
This method exploits the increased melting temperature of 
methylated DNA, which allows methylated and unmethylated 
10 fragments to be resolved by denaturing gradient gel 
electrophoresis (Collins and Myers, 1987) . Both methods are 
well-established and both will allow unambiguous 
localization of methylated sites in HIV-1 LTR sequences. 

15 The embodiments described herein of this invention indicate 
that expression of a finger protein/M. Ssel chimera targeted 
to a critical CpG site in the HIV-1 LTR methylates and 
inactivates expression of a reporter gene that is driven by 
the LTR. 

20 ■- 

Example 5* Inhibition of HIV-1 replication in T lymphocytes 
productively infected with HIV-1. Jurkat E6-1 cells are 
electroporated : with infectious molecular clones of .' HIV-1 
strain NL4-3, and a productive infection established. The 

25 HIV-producing cells . are transf ected with ■< the finger 
protein/M. Sssl -construct described in the previous section, 
together with pSV2Neo as a selectable marker . . A vector, con- 
trol and a finger protein-only control may be used. This 
latter control is necessary to distinguish inhibition ,via 

3 0 methylation from that resulting from direct binding of the 
finger protein to its target site, as has happened in .one 
other case (Choo et al., 1995). The finger protein/M. Sssl 
chimera is designed to cause de novo methylation and 
transcriptional repression of HIV-1 proviral DNA. 

35 Methylation is assayed as described in the previous section, 
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and viral transcription is assayed by RNA blot hy- 
bridization. 

The plasmid-mediated system test described above is 
5 convenient and reliable but does not simulate conditions 
that may be encountered in a therapeutic application. A 
vector that addresses the needs of therapeutic applications 
is therefore used and described below. One of the most 
promising viral vectors is based on adeno-associated virus 
10 (AAV) , and the methylating agent may be delivered through 
use of recombinant AAV. 

AAV has several advantages as a transducing vector: I. AAV 
is not associated with any disease, ii. The viral DNA is 

15 thought to integrate at a unique site on chromosome 19q, 
with little risk of insertional mutagenesis, iii. it has 
been reported that AAV can infect non-replicating cells 
(reviewed in Berns and Linden, 1995) , iv. AAV can infect 
lymphoid cells (Mendelson et al . , 1992), and transcription 

20 units borne by recombinant AAV strains are expressed in T 
lymphocytes (Chatterjee et al . , 1992), and v. Virulent 
strains of AAV cannot arise by recombination within infected 
cells, as can occur with retroviral vectors. -.„-.-;. 

25 Introduction of methylating agents into transducing AAV , 
vectors . is straightforward. The finger protein/DNA 

methyltransf erase coding region under control of the CMV 
immediate early enhancer-promoter is amplified by PCR with 
primers that introduce Xbal sites at both ends of the 

3 0 construct. This expression cassette is ligated between the 
Xbal sites of pAVl, an infectious molecular clone of AAV 
(Banerjee et al., 1992). The resulting clones are selected 
and propagated in E. coli, and confirmed by direct 
sequencing; clones of the desired specificity are termed 

35 pAV-ZMet. Control clones that express only the zinc finger 
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protein are constructed and termed pAV-Zif; clones that 
express only the mutant M.SssI are constructed and termed 
pAV-Met. Infectious recombinant virus stocks are obtained 
by co-transf ection of adenovirus -infected HeLa cells with 

5 pAV-ZMet, pAV-Zif, or pAV-Met, together with pTAAV, a 
plasmid that encodes proteins required for replication and 
packaging (Banerjee et al., 1992). The titer of virus 
stocks obtained in this manner are commonly 10 7 ml 1 
(Banerjee et al., 1992); such viruses are also capable of 

0 infecting CD4 + T lymphocytes such as Jurkat cell with high 
efficiency (Mendelson et al . , 1992). The recombinant 
viruses are termed AAV-ZMet, AAV-Zif, and AAV-Met, in 
keeping with the terminology of their plasmid precursors. 

5 Jurkat cells productively infected with HIV-1 strain NL4-3 
(as described above) are infected with AAV-ZMet, AAV-Zif , or 
AAV-Met at an MOI that infects >90% of the cell population, 
as established by immunofluorescence microscopy with 
antibodies against the c-MYC epitope encoded by the fusion 

0 proteins. Jurkat cells are highly susceptible to infection 
by AAV (Mendelson et al . , 1992) RNA blot hybridization is 
.-used to evaluate the effect of expression on transcript 
:u accumulation, . as has -been done in antisense inhibition 
experiments that utilized AAV expression vectors (Banerjee 

5 et al., 1992). The methylation status of integrated HIV-1. 
proviral DNA may be evaluated as described in Example 3. 
Comparison iof transcript levels, cell survival, and proviral 
- methylation among cultures infected with vector control, 
AAV-ZMet, AAV-Zif, and AAV-Met can give a clear indication 

0 of the efficacy of targeted methylation in suppressing HIV-1 
expression under conditions that approximate those to be 
encountered in actual therapeutic applications. 

Example €• Design, selection, and affinity maturation of 
5 zinc finger-DNA methyltransf erase chimeras that methylate 
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critical CpG sit s in a promot r of th hepatitis B virus. 

Chronic hepatitis due to infection with Hepatitis B virus 
(HBV) is a major cause of cirrhosis of the liver and 
5 hepatocellular carcinoma (reviewed by Ganem and Varmus , 
1987) . Approximately 1.5% of the population is infected 
with HBV (reviewed by Saracco and Rezzetto, 1995) and the 
worldwide morbidity and mortality associated with chronic 
HBV infection is far greater than that due to HIV-1 

10 infection. As shown in Figure 10, the HBV genome is rich in 
CpG sites (there are 103 in a genome of only 3.2 kb; Ono et 
al., 1983) and transcription from HBV promoters is sensitive 
to CpG methylation (Miller and Robinson, 1983; Pourcel et 
al., 1990; The contents of these references are incorporated 

15 in their entirety into the subject application) . HBV 
proviral DNA is therefore susceptible to inactivation by 
targeted methylation according to the methods elaborated for 
HIV-1 inactivation herein. 

20 Identification of critical CpG sites, design of chimeric 
zinc finger protein/DNA methyltransf erases , and cyclic in 
vivo/in vitro selection of chimeras that methylate only the 
target CpG site is as described for the HIV-1 targets in the 
above examples. . ■ . * 

25 _■■ 

One embodiment of the subject invention is to target the 
known promoters of the hepatitis B virus and inactivate the 
viral machinery via methylation. The site for targeted 
methylation in the hepatitis B promoter is selected as 

30 follows. First, the site will have the ability to repress 
transcription when methylated; this is established in the 
experiments of Example 2 above. Second, the binding site 
for the zinc finger moiety will be adjacent to (rather than 
within) the recognition sequence of a regulatory factor. 

35 The factors that bind to the promoter of the viral X protein 
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or the pre-S2/5 promoter are host-encoded and are involved 
in the transcription of many cellular genes; methylation of 
such sites is expected to cause unacceptable toxicity 
through inactivation of vital cellular genes. Third, the 
5 recognition sequence is optimally from 6 to 25 base pairs in 
length and more preferably at least 9 base pairs in length. 



Zinc finger proteins that meet the above criteria are 
selected from combinatorial expression libraries by phage 

10 display, and fused to the CpG-specific DNA methyltransf erase 
M.Sssl as described in Example 1, and depicted in Figure 9. 
Rounds of cyclic in vivo/in vitro selection (also described 
in Example 1) are used to obtain variants that methylate 
only CpG sites adjacent to the binding site of the zinc 

15 finger moiety. As before, more than one iteration of the 
mutagenesis/selection procedure may be required, but the 
process has no difficult or time-consuming steps. Only a 
few days are required for each iteration, and large numbers 
of clones (>10 9 per cycle) can be subjected to selection at 

2 0 once. 



p, The method for the selection of tridactyl zinc finger 

W proteins that bind to predetermined sequences is shown in 

a 8 Figure 9 . . 

O 25 

y The method depicted in Figure 9 is used to select a 

tridactyl finger protein that binds to a 9 bp sequence adja- 
cent to a critical CpG site in either the promoter of viral 
protein X or the pre-S2/5 promoter of hepatitis B. The 
3 0 spacing between the recognition sequence and the target CpG 
site may be adjusted to conform to the sum of the estimated 
radii of the two globular proteins of masses 30,000 (M.SssI) 
and 20,000 (the tridactyl zinc finger protein). This 
spacing is equivalent to roughly 25 base pairs. 



m 



35 
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Figure 11 A and Figure 11B illustrate targeted methylation 
of a preselected CpG site in a pLS vector which encodes a 
lexA-AT. SssI fusion protein. This data proves that a CpG- 
specific DNA methyltransf erase can be directed to a target 
5 site that is adjacent to the binding site of a sequence- 
specific DNA binding protein. This targeted methylation was 
achieved in living bacterial cells . Mutagenesis and cyclic 
in vivo/in vitro selection is being applied so as to obtain 
a chimeric protein that will methylate the target site but 
10 have no effect on collateral CpG sites. The data shown 
above and specifically in Figures 11A-11B, demonstrate that 
the principle of targeted methylation has been reduced to 
practice in the laboratory. 

15 Example 7: Targeted Methylation in the Repression of HIV-1 
Replication 

A. Construction of chimeric DNA methyl transferase of novel 
and predetermined specificity. 

20 

The Lac repressor was fused to the N- terminus of M.SssI by 
recombinant DNA technology and the chimeric, protein was 
expressed in coli. The construct used in these 

experiments is, shown in detail in Figure 12. The construct 

25 also contained a synthetic sequence that included the Lac 
repressor binding site and a cluster of CpG dinucleotides . 
This construct was used to establish three important points : 
First, that chimeric DNA methyl transferase could be produced 
in bacterial cells in a stable and soluble form; second, 

3 0 that the chimeric DNA methyltransf erase would selectively 
methylate CpG sites in the vicinity of the recognition 
sequence of the lac repressor, and third, the optimal 
spacing between recognition sequence and target CpG site is 
20-3 0 nucleotides. 

35 
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The construct shown in Figure 12 was electroporateci into E . 
coli and synthesis of fusion protein induced by addition of 
isopropylthiogalactoside to culture medium. After 3 hours 
of incubation cells were lysed, and protein extracts were 

5 subjected to SDS gel electrophoresis and immunoblot analysis 
. with an anti-Lac repressor antibody. As shown in Figure 13 , 
the chimeric DNA methyltransf erase was soluble and resistant 
to proteolysis as shown by its appearance as a single band 
of the appropriate size. There was little sign of a 

0 reduction in growth rate upon induction of synthesis of the 
chimeric DNA methyltransf erase . This confirmed that the 
chimeric DNA methyltransf erase was not toxic to the host 
cells . 



5 Plasmid DNA was purified from the same cells and high 
resolution mapping of methylated sites was carried out by 
the bisulfate genomic sequencing method. Bisulfite converts 
cytosine to uracil by oxidative deamination. 5- 
methylcytosine is resistant to bisulfite attack. Upon PCR 

0 amplification all cytosine will be copied as thymidines but 
. 5-methylcytosine will be preserved as cytosines. As shown 
in Figure 14, plasmid DNA exposed in vivo to Lac repressor- 
M.SssI fusion proteins underwent methylation at CpG sites 
. near the Lac repressor binding site; vector DNA was not 

5 methylated, and plasmid exposed to M.SssI became methylated 
at all CpG sites. These data confirm that fusing a DNA 
methyltransferase to a sequence-specific DNA binding protein 
can limit methylation to CpG sites in the vicinity of the 
recognition sequence of the DNA binding protein as 

0 originally proposed. 

Figure 14 also shows that targeted methylation occurred 20- 
30 nucleotides away from the 3' border of the Lac repressor 
binding site. This is taken to reflect steric interference 
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between protein moieties. The result was expected and is 
not a limitation of the method. 



The Lac repressor-M. SssI fusion proteins confirmed the 
5 principle of targeted methylation. Figure 15 depicts a new 
class of chimeric DNA methyltransf erases that have been 
constructed and confirmed by DNA sequencing. The sequence - 
specific DNA binding moiety is a three-finger zinc finger 
protein (Zif268) of known binding specificity. These agents 
.0 are being subjected to the same characterization as the Lac 
repressor-M. SssI fusions. They have also been cloned into 
mammalian expression vectors and experiments in cultured 
primate cells are being performed. 

5 In summary, chimeric DNA methyltransf erase-DNA binding 
proteins have been shown to selectively methylated CpG sites 
in the vicinity of the recognition sequence of the DNA 
binding protein moiety. 

0 Inactivation of HIV-1 by CpG methylation. 

As shown in Figure 16, the LTR of HIV-1 has been cloned 
upstream of two reporter genes, chloramphenicol 
acetyltransf erase (CAT) and firefly lucif erase (Luc). Both 
constructs drive high-level expression of the CAT of Luc 
5 . reporter genes when transfected into tat-expressing HL2/3 or 
HLTAT human cervical carcinoma cells . 

Methylation is known to suppress transcription from most 
promoters but its effect on HIV-1 LTR-driven transcription 

0 has not been documented in a system that resembles the 
infected cells. Figure 17 shows that CpG methylation of 9 
CpG sites in the HIV-1 LTR reduces transcription to near 
background levels; the effect is at least 2 0 fold. The 
effect appears to be irreversible and does not require 

5 continued application or expression of any foreign agents. 
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CpG methylation can therefore be regarded as the most 
powerful way to suppress the activity of HIV-1. Work is 
underway that will identify the CpG sites that mediate the 
biological effects of methylation. 

5 

Work as described herein, has shown that M.SssI does not 
methylate single stranded DNA. This has led to the 
development of a new method for the region- specif ic 
methylation of a DNA fragment. A single -stranded 

0 oligonucleotide or longer fragment synthesized by asymmetric 
PGR is annealed to a single- stranded circular DNA and 
treated with M.SssI. CpG sites in the double stranded 
region of the plasmid are methylated while all single 
stranded regions are unmodified. This new method reduces 

5 the time required to identify critical CpG sites. The 
regional methylation method is being used to evaluate the 
effects of CpG methylation on the HIV-1 LTR versus the gag 
leader sequence; this latter sequence is rich and has 
strongly conserved CpG sites (Figure 16) and methylation of 

0 the sites may suppress transcription of HIV-1 as well or 
better than methylation of LTR sequences. 
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